ProFAL: PROtein Functional Annotation through Literature

نویسندگان

  • Francisco M. Couto
  • Mário J. Silva
  • Pedro Coutinho
چکیده

We introduce ProFAL (PROtein Functional Annotation through Literature), a new information system for automatic annotation of biological databases using Bioinformatics methods. The annotations are (gene-product, functional property) pairs, associating the attributes of a gene-product, stored in the database, to functional properties. The system retrieves documents related to each geneproduct from online databases and extracts functional properties from their text. To validate these annotations, ProFAL implements heuristics based on a measure of correlation between annotations that we have introduced. To verify the validated annotations, ProFAL also provides a specific interface for manual curation. We evaluate the implementation and performance of ProFAL in a case-study.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Literature Based Functional Annotation of Genes

This paper proposes a fast and automatic functional annotation method for hundreds of unannotated genes in biological databases. Genes are often described and discussed in biomedical literature that can be used to unravel the functions of genes. We have proposed a reliable text mining approach to discover functional annotations for genes from literature. This approach was validated by building ...

متن کامل

Functional Annotation of Two Hypothetical Proteins Reveals Valuable Proteins Involved in Response to Salinity: An in silico Approach

Through the exponential development in the specification of sequences and structures of proteins by genome sequencing and structural genomics approaches, there is a growing demand for valid bioinformatics methods to define these proteins function. In this study, our objective is to identify the function of unknown proteins from UCB-1 pistachio rootstock and specify their class...

متن کامل

FragKB: Structural and Literature Annotation Resource of Conserved Peptide Fragments and Residues

BACKGROUND FragKB (Fragment Knowledgebase) is a repository of clusters of structurally similar fragments from proteins. Fragments are annotated with information at the level of sequence, structure and function, integrating biological descriptions derived from multiple existing resources and text mining. METHODOLOGY FragKB contains approximately 400,000 conserved fragments from 4,800 represent...

متن کامل

Detection of Protein Catalytic Sites in the Biomedical Literature

This paper explores the application of text mining to the problem of detecting protein functional sites in the biomedical literature, and specifically considers the task of identifying catalytic sites in that literature. We provide strong evidence for the need for text mining techniques that address residue-level protein function annotation through an analysis of two corpora in terms of their c...

متن کامل

Grounding annotations in published literature with an emphasis on the functional roles used in metabolic models

Accurate genome annotations in databases are a critical resource available to the scientific community for analysis and research. Inaccurate and inconsistent annotations exist as a result of errors generated from mass automated annotation, and currently act as a barrier to the application of bioinformatics. The purpose of this effort was to improve the SEED by improving the connection of functi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003